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Abstract The cosmological potential of large-scale structure observations for cos¬ 
mology have been extensively discussed in the litterature. In particular, it has 
recently been shown how Suiiyaev-Zel’dovich (SZ) cluster surveys can be used to 
constrain dark energy parameters. In this paper, we study whether selection and 
systematics effects will limit future wide-field SZ surveys from achieving their cos¬ 
mological potential. For this p urpose, we use a sky simulation and an SZ-cluster 
detection software presented in ( Pires et ali200R 1. using the future Olirnpo, APEX 
and Planck surveys as a concrete examples. We show that the SZ-cluster selection 
function and contamination of SZ-cluster catalogues are more complex than is 
usually assumed. In particular, the simulated field-to-field detected cluster counts 
is a factor 3 larger than the expected Poisson fluctuations. We also study the im¬ 
pact of missing redshift information and of the uncertainty of the scaling relations 
for low mass clusters. We quantify, through hypothesis tests, how near-future SZ 
experiments can be used to discriminate between different structure formation 
models. Using a maximum likelihood approach, we then study the impact of these 
systematics on the joint measurement of cosmological models and of cluster scaling 
relations. 
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1. Introduction 


In the next few years, a new generation of dedicated instruments based on 


bolometer cameras (APEX 1 , ACT 2 , BOLOCAM 3 , OLIMPO Magi et al 


large -array 


20031. and 


improved interferometers (AMI 4 , AMiBA 5 , SZA 6 ) will provide large amount of infor¬ 
mation on cosmic structure formation and evolution, and thus on cosmological mod¬ 


els. The Planck satellite I Lamarre et al 


200 41. to be launched in 2007, will provide a 


full-sky catalogue of galaxy cluste rs detected by their Sunyaev-ZePdovich (SZ) signal 


(Sunvaev & Zel’dovicb 


Midi 


1972). The potential of SZ observations results from the 


properties of the SZ effect: the lack of surface dimming and the ’’clean” measurement of 
thermal energy of the cluster gas, should afford a measure of the cluster mass function up 
to high redshift, with reduced systematics when combined with X-Ray observations and 
weak lensing surveys. The distribution of cluster abundance with redshift is sensitive to 
the cosmological paramet ers gs and Om. and also to a lower extend to Oa and the dark 


energ y 


2(M|). 


equation-of-state IIBarbosa et al 


1996; 


Oukbir fc Blanchar 


1 


1997 


Haiman et al 


Battve & Well er 2003) studied the dependence of these cos mological constraints 


on large scale structure formation and gas cluster physics models. 


Melin et al 


(2005) 


presented a first study of the selection function of large SZ-cluster survey. 

In this paper, we use simulations of the sky and of an SZ exp eriment, along with 
a recent cluster detection pipeline presented in 


Pires et al 



120051) . to simulate future 


large-array bolometer observations and cluster detections. We first discuss photometric 
issues, and then present a detected cluster catalogue, with its selection function, and 
purity curves. Those are found to be complex. In particular, the contamination of the 
cluster catalogue is quantified as a function of cluster brightness. We also compute the 
count variance in the observed catalogues. We then assume that the observations and 
cluster detection can be statistically described by an observation model. This observa¬ 
tion model allows us to transform a semi-analytic cosmology-motivated cluster count 


1 http://bolo.berkeley.edu/apexsz 

2 http://www.hep.upenn.edu/~angelica/act/act.html 

3 http://astro.caltech.edu/~lgg/bolocam_front.htm 

4 http://www.mrao.cam.ac.uk/telescopes/ami/index.html 

5 http://www.asiaa.sinica.edu.tw/amiba 

6 http://astro.uchicago.edu/sze 
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functions, dzd Y( z y) » a set °f probability density functions (pdf) of the detected 
cluster observed parameters Nobs, Yobs > zobs and of the contaminants Ncont and Ycont , 
where Y is the Compton parameter integrated over the cluster angular size, and z the 
redsliift. This observation model is found to be accurate enough given the statistics of 
the upcoming SZ cluster surveys and to be computationally very effective. Based on this 
model, we then discuss our ability to constraint cosmology assumptions and parameters. 
We show, using an hypothesis test method, how future SZ surveys will make it possible 
to distinuigh between several mass functions. We quantify the constraint that such an 
experiment would place on the effective “heating” parameter T*, using the cosmological 
parameters values measured by WMAP to break degeneracies. We conclude by show¬ 
ing how any conclusions on cosmological parameters are sensitive to inaccuracies in the 
observation model. 

Future large-array bolometer surveys share some common features. They observe the 
sky in several frequency bands to facilitate astronomical source separation. They use 
large bolometer matrices to maximise redundancy of observation on the sky and speed 
up field coverage. They are ambitious in terms of mission length, given the technology. 
In this paper, we mostly use the Olimpo project as a concrete example of an upcoming 
bolometer-based large SZ survey. 


2. Montecarlo simulations 


In the following, we use the sky simu lation software, t he instrument model and the 


cluster detection pipeline described in (Pires et al 


200 51. We briefly summarise it here 


for convenience. We simulated four astrophysical contributions to the sky map: primordial 
CM B anisotr opies (excluding the dipole), bright infrared galaxies as observed by SCUBA 


Borys et al 


2003), the infrared emission of the Galaxy and SZ-clusters. All simulations 


uses a cosmological model with parameters shown in tabled 

Figure |T| shows the distribution of the generated cluster as a function of redshift and 
integrated Compton flux. 


The frequency dependence of the bright infrared sources and of galactic dust are 
described by a grey-body spectrum. The spectral index of bright infrared sources is 
generated randomly for each sources between 1.5 and 2. Table Q provide the noise level 
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Table 1. Cosmological and gas physics parameters used in the simulations. Densities 
relative to critical density are labelled Q. Qtot is the density of universe, all components 
included, f lb is the baryon density, Da the vacuum energy density, SIdm the dark matter 
density, h is the reduced Hubble constant, n s the spectral index of primordial density 
power spectrum, ug the rms density fluctuations in spheres of 8 MPc diameter, fMass the 
mass function used in cluster abundance computations, T* the cluster mass-temperature 
normalisation factor, and f g the cluster gas mass fraction. The double vertical line des- 
tinguishes the primordial cosmological parameters, from the ingredients of the structure 
formation semi-analytic model. 
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Figure 1 . Generated cluster distribution as a function of redshift and integrated 
Compton flux. 


and the FWHM of antenna lobe at each frequencies, which is assumed be Gaussian. 
Bandwidth filters are assumed have a top hat response. 

Figure |21 shows the ‘observed’ maps, simulated using the Olimpo pa rameters. W e then 
apply an Independent Component Analysis method named JADE I CardosollflOfM on our 


map, after a wavelet transform. JADE separates the SZ signal from the other astrophys- 
ical sources effectively, as long as the noise level is kept low enough, and provid es a noisy 
SZ map. We apply then a Multiscale Entropy False Discovery Rate filter I Starck et al. 
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Figure 2. Simulations of sky maps, as observed by a large-array bolometer experiment. 
For these simulations, we used the Olirnpo experiment model. From Left to right, 143, 
217, 385, and 600 GHz bands are shown. In the two lower frequencies band, CMB primor¬ 
dial anisotropies are the dominant features. At higher frequencies bands, bright Infrared 
galaxies and Galactic dust become dominant. The SZ cluster signal is sub-dominant at 
all frequencies. 


Observation bands v [GHz] 

143 

217 

385 

600 

Bandwidth A v [GHz] 

30 

30 

30 

30 

Bolometer Number 

19 

37 

37 

37 

Lobe Width, FWHM [ arcmin] 

3 

2 

2 

2 

Noise level [/tKcmbs 1 ^ 2 ] 

150 

200 

500 

5000 


Table 2. Foreseen experimental features of the Olirnpo balloon bolometer project. 


2005) on the map and detect sources using the SExtractor software (Bertin & Arnouti 


1996 1. Detected sources can be reliably associated with simulated clusters and are labelled 


as ’true’ clusters. This allows us to compute the selection function and the photometric 
accuracy of our simulated observations. Detections that are not associated with simulated 
clusters are identified as contaminants. We can then calculate the purity of our recovered 
sample and the brightness distribution of the contaminants. 


2.1. Photometry 

Selecting the sources associated with simulated clusters, we plot in figure [3] the observed 
cluster flux Yobs versus the true simulated flux Y th . and derive our photometry model, 
i.e. the probability density function pdf(Fof, s \Yth)- The observed flux is strongly over- 
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estimated at low brightness due to the Malmquist-Eddington bias (Malmquist 


192C). 


Our first attempt for a statistical model reproduces very well the simulated photometric 
behaviour, except for the (small) non-Gaussian tails. 
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Figure 3. Left: cluster reconstructed flux versus the true simulated flux, and our pho¬ 
tometry model contours. 20 cumulative Monte-Carlo simulations where used for this plot. 
Right: SZ cluster reconstructed virial size versus true simulated virial size. No correlation 
is seen. 


2.2. Cluster Size reconstruction 

One way to infer the redshift of a cluster would be to measure its virial radius. Figure 0 
plots the reconstructed clusters virial radius versus their true (simulated) virial radius. 
One can hardly see a significant correlation between the simulated virial radius and the 
observed virial radius. This is the reason why we decided to neglect this information in 
the following work. 
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Figure 4. Completness as a function of redshift, flux (left) and mass (middle), as sim¬ 
ulated from a semi-analytic large scale structure and cosmology model. We used design 
parameters of the Olirnpo project to model observation performance. Right: modelled 
selection function after extended simulations. 

2.3. Completeness 

From the true cluster catalogue, we computed the cluster detection probability as a 
function of cluster integrated fluxes, redshifts and masses. Figures 0 left and middle, 
show the results. We see that a selection function can not be taken as a simple cut 
in total flux, nor in mass. We also notice that clusters at large redshift are detected, 
even though very few are predicted by the cosmological model. To quantify the selection 
function at high redshift, we therefore introduced “by hand” in our simulated map, 
10 % additional high-z clusters, randomly generated in the guessed Y-threshold area. 
We averaged 100 Monte-Carlos and we obtained the completness map plotted at figure 
01 right. The selection function reduces to a simple Y sensitivity curve at large redshift, 
when cluster sizes become smaller than angular resolution. But at redshift below 1, where 
we expect to detect most of the clusters, the completness curve is strongly distorted 
toward high Compton flux. For convenience, we provide in Annex0the tabulated values 
of Olirnpo selection function versus cluster mass and redshift. 

2.4. Purity and contamination 

Future SZ-cluster experiments won’t be able to easily sort the contamination from the 
true clusters. Our evaluation of the observed flux distribution of contaminants is done by 








SZ clusters selection function 
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Figure 5. We ran 100 montecarlo on 400 deg 2 . Left: the black curve is the histogram 
of generated cluster flux, compared to the the blue histogram of true cluster detection. 
Right: the blue histogram is the true cluster observed flux. The flux distribution of the 
contamination is plotted in orange. The red curve is our modelled flux distribution of 
contamination. 


selecting sources that are not associated with simulated clusters, and is shown at figure 
0 The contamination histogram provides the red curve which we use as our modelled 
flux distribution of contamination. An integration over histograms shown in igure[5]lead 
to a sample purity value of 91%, tuned by choosing the detection threshold. 


2.5. Sources counts 

Cluster counts provides powerful information for large scale structure physics and cos¬ 
mology. If the counts are dominated by field-to-field variations, one would expect them 
to follow a Poisson distribution. Figure 0 shows the histograms of cluster, contamination 
and source counts for 100 simulated fields. We notice that all the histograms show non- 
Gaussian tails. A closer look shows that this happen when the confusion of few bright 
clusters, bias our SZ-map normalisation method and lead to a low value for the cluster 
detection threshold. As a result, the contamination and thus the total cluster counts 
increases. This simulation-only artifact will be solved in the future development of the 
detection pipeline. 
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We also notice that the peak of the distributions are well fitted by Gaussian curves. 
By computing the peak FWHM, we note a factor of 3 excess widths, relative to Poisson’s 
distribution expectation. The origin of this excess is not yet settled. Possible explanations 
are other unidentified systematic effect in the cluster detection pipeline, or confusion due 
to the few large clusters, that mask smaller (but above threshold) cluster and thus induces 
lower statistic processes in the count variance. If it can not be avoided, cosmological 
constraints deduced from future observations will therefore have to take this effect into 
account. In the following we will use fits to the cluster and contamination counts (red 
curves in figure EJ to construct our observation model. 
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Figure 6. From left to right: true clusters, contamination and sources counts his¬ 
tograms for 100 simulations. Red curve fits are used in the following observations’ model. 


3. Observation model 

Our first goal in building an observation model is to identify and understand systematic 
effects in large-array bolometer surveys, relevant to cluster detection and cosmology. The 
second is to avoid to run a full Monte-Carlo chain to generate source catalogue observed 
by SZ-survey, a time consuming step in an analysis software that limits the number of 
possible iterations in partice. This is a strong assumption, that we checked up to the 
precision of the statistical uncertainties of upcoming surveys. 
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3.1. Observation model ingredients 

Semi-analytic LSS model provides the expected number N c i US Th of cluster of flux above 
a chosen threshold Yrhres and the cluster probability density function pdf(Yth, zth )• The 
observation model includes: 

A 2D selection function Se\(Y t h, zth ) giving the probability of a cluster of flux Y t h 
and redshift z t h to be detected. This 2D function is calibrated on simulation as described 
at paragraph 12.31 The expected number of true detected cluster can then be computed 
by integration of 



(1) 


Detection count variance model. In order to take into account the excess on true 
cluster counts described above, we introduce the pdf of observing N obs clusters, given 
NobsTh, pdf(lV 0 b s | N 0 bsTh)- We simply assume that it follows a gaussian law, with a 


width a = A clus y/N 0 b S Th, with A clus fitted to the value 3.0, as explained at paragraph 

ESI 


A photometric model pdf(Y 0 f,. s , z 0 b s \ Y t h, Zth ) that provides the pdf of observing 
Y 0 b s and z obs given Y th and z th . 

A contamination model: we assume that contamination are driven by the foreground 
models and that these do not depend on cosmological parameters. The pdf of the ob¬ 
served flux of contaminants pdf (Ycont) is deduced from the Monte-Carlo simulations of 
paragraph 12.41 after normalisation. The expectation of the contaminant counts and its 
variance are taken to be constant and fitted from the distribution shown in figure El 

An error model on the redshift determinations, when such a complementary mea¬ 
surement is available: pdf(z 0 6 S | Zth)- 

3.2. Simplifying assumptions 

The above components of the observation model have been derived from the simulations, 
given instruments parameters and for the concordance cosmological model. They have 
been shown to be very sensitive to experiment properties such as noise level, number of 
observation bands, etc. When constraining cosmological parameters, these experimental 
effects are expected to be under control. On the other hand, we assume that the obser- 
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Figure 7. Left: The cluster distribution generated by simulations, d ^ Y ( z th, Yth ), and 
the observed cluster distribution, d ^° dYh ( z th, Yth), in 100 cumulative Monte-Carlo sim¬ 
ulations (middle), and from the observation model (right). The axes are the integrated 
flux Y in arcmin 2 versus redshift. 


vation model is not sensitive to the cosmological parameters, when these are reasonably 
close to our reference model. This assumption is strong and not obvious, since in large- 
array bolometers survey, the contribution of source confusion to the photometric noise 
may not be negligible. We checked the validity of this assumption, all other parame¬ 
ters being kept constant, by changing the cluster map density by a factor 1.5 and 0.75. 
Both recovered observation models were compatible with the above model, except for a 
small increase in the width of the photometry curve (paragraph l2.Hl in the large density 
option. We assumed this to be acceptable since, would such a dramatic discrepancy of 
cluster density be observed, we would recalibrate our observation model on representative 
simulations. 

Thus given a cosmological model and observations’ model, we derive a set probabil¬ 
ity density function describing our observations: pdf(iVo6s)j pdf(y o 6s> z 0 b s ), pdf(Y’cont), 
pdf (Ncont)- Figure 0 shows the distributions of flux and redshifts of detected clusters, 
generated by full Monte-Carlo and from our observation model. Those are remarkably 
similar, thus confirming the validity of our observation model. 

We also tested whether the use of the observation model would bias the cosmolog¬ 
ical parameter estimation. For this purpose, we computed 100 full Monte-Carlo source 
catalogues. For each catalogue, we computed the cosmological parameters using our ob- 
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Figure 8. Probability density of the recovered cosmological parameters as and Hm, for 
100 Monte-Carlo simulations and using the observation model. Diamond is the model 
used at the input of simulations. Cross, is the maximun of occurrence of reconstructed 
parameters. 

servations’ model. Figure 0 show the surface density of two of the fitted parameters as 
and mostly relevant for this study. We observe that the input cosmological param¬ 
eters are well within the 68% CL contour of the distribution. Thus, the bias induced by 
the observation model is small compared to the statistical error of the observations. 

We conclude that the use of an observation model is legitimate given the accuracy 
of upcoming experiments. This observation model will be improved: taking into account 
non gaussian tails in our photometry model is the main improvement foreseen. In the 
following, all source catalogues have been generated using the observation model. 

4. Cosmological implications 

The mains physics goals of large SZ-cluster surveys are to learn more about cluster gas 
physics, large scale structure, and cosmology. These physical models are parametrised, 
and involve assumptions that can be tested by future SZ cluster surveys. In the following, 
we first present statistical tools and results testing the compatibility of our data, with 
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a parametrised model family in paragraph hypothesis tests. Then we show how SZ clus¬ 
ter data can constraint the mass temperature normalisation factor T*, using a classical 
parameter estimation method. Assuming then T* known, we explore the potential of SZ 
cluster survey, for constraining cosmological parameters !!,w and ag. We conclude by 
showing the effect on cosmological parameters of oversimplifying features of the obser¬ 
vations’ model. In the following we assumed we have available a catalogue of observed 
sources corresponding to a nominal Olirnpo scientific flight: 500 sources observed over 
300 square degrees. 


4.1. Extended likelihood 


The tool for all the following statistical tests is the so called extended likelihood of the 
cosmological parameters C, given the experiment exp: L(C\exp). 


dP 


N<- 


N? 


L ( C \ ex P) = 7^s^( NS ° Ur -’ C ) n pdf(z? lus , Y f lus ; c ) n Pdf(Y?-;C) (2) 


dN s ' 

with N Sour = N cius + N? our 


The likelihood incoporates three kinds of information available in the data.The first factor 
is the probability of observing N Sour sources given the cosmological parameters C. The 
second factor is the probability of observing a cluster with a flux Y and at redshift z 
(using follow-up observations). We assume that the follow-up observation established 
whether the source is a cluster of galaxies, or a false detection. In the latter case, this 
source is excluded from the likelihood, except from the first factor. The third factor is the 
probability of observing a source of flux Y, when no follow-up observations were available. 
In this case, we do not know whether this source is a SZ-cluster or a false detection. Our 
observation model (paragraph [3]) provides these three factors in the likelihood, either 
directly or after integration and normalisation of the distributions. 


4.2. Hypothesis tests 

The question we wish to answer before constraining parameters models is whether there 
exist a parametrised model which is compatible with our data. To settle this issue, we use 
an hypothesis test method. For a cosmological model, we generate by Monte-Carlo a large 
number of observed source catalogues, compute their likelihood for the given cosmological 
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300 


Figure 9. Left: histogram of log-likelihood L (black) for N Monte-Carlo catalogue of a 
Press-Schechter cosmological model. The peak is fitted by a gaussian law (red line), with 

mean Lrnean- 

Right: red line is the probability versus A L = L — L mean of observing a Press-Schechter 
based catalogue with A L. Vertical dashed line is the A L computed for a catalogue 
generated from a Sheth and Tormen model. The probability of compatibility is lower 
than 10" 5 . 


model and build an histogram of the likelihood (see figure |9j. The normalisation of the 
integral of the histogram provides the probability curve of an observed catalogue to be 
compatible with the cosmological model. The use of the statistical observation model 
speeds up this work dramatically. Figure 0 left, shows the histogram of the likelihoods 
compute d ass umin g the co ncordance cosmological model and a Press and Schecliter mass 


function (Press & Schechter 


19741) for the clusters. The black line shows the likelihood 


value computed for a source catalog ue computed with the same cosmology, but with the 
mass function of Sheth and Tormen ( Sheth et ahlboOl ). The probability of compatibility 
is lower than 10 -5 . The Press and Schechter hypothesis is thus rejected by the data. In 
practise, cosmological parameters are free parameters and we often obtain a compatibility 
valley for our parameters with our “observed” data. Other sources of constraints on 


400 
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cosmological parameters (such as CMB anisotropies) will allow us to select the relevant 
cosmological models and mass function. 



Figure 10. Left: histogram of difference of log-likelihood A (black) for N Monte-Carlo 
catalogues of the best cosmological model C m in according to our data set. Dashed blue 
line is the y 2 law, expected for Gaussian distributions with 2 degrees of freedom. 68 % 
and 95% confidence levels are shown as dashed black and red lines. The \ 2 approximation 
is very optimistic. 


4.3. Parameter Estimation 

Once we have selected a parametrised model compatible with out data, the next question 
is to estimate a set of best cosmological parameters, in agreement with data and to 
compute the associated errors (or confidence levels). For this purpose, we minimise the 
function —lnL{C |D) over C, vector of the model parameters, to find the best model 
according to our data C m in and his likelihood L m i n . Then, we generate, according to 
Cmin , many source catalogues Cj, minimise the likelihood to find the best Model Ci 
matching Ci, and build the histogram of 

Ai = — 


2 [lnL(D\Ci) - ln(L min )] 


( 3 ) 
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and the map of A, at position C^. Computing the normalised cumulative distribution 
of the variable A ('figure fTTl allows us to compute A,; values of the 68%, 95% and 99% 
confidence level limits to be used to draw contours on the model map. Would the pdf 
be gaussian distributed, A,; distribution would follow a x 2 law. We notice that cluster 
density versus redshift and flux are not gaussian. 
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Figure 11. Expected constraints on a g and from an Olimpo scientific flight, with 
full spectroscopic follow-up of the sources. All other cosmological parameters, have been 
set to the values in table |T| These constraints are severely degraded due by the excess 
count variance observed in the Monte-Carlo simulations. 



4.4. Constraints on cosmological parameters for the Olimpo SZ-cluster surveys 


We now use the above tools to constraint the cosmological parameters. In the following, 
we will assume a ACDM cosmological model with parameter list of table |T| The most 
important parameters for large scale structure formation and SZ-clusters are_(7g_and 


jrs well as T*, the normalisation of the mass to temperature scale relations I Pieroaoli et al 


200 11 in cluster formation models. We plot in figure^2th e expected constraints on ct 8 and 
CLm from SZ-cluster observations, assuming all other cosmological parameters known, at 
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their simulated values. We assumed that follow-up observations provided redshifts for all 
the sources. This is the best constraint achievable, according to our observation model. 

We observed that these constraints are rather pessimistic, given the ambition of the 
SZ-cluster projects. This is due to the excess width in the count variance, observed in 
our Monte-Carlo simulations, equivalent to a statistic loss of a factor 10. 



0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 

Cm 0 m 


Figure 12. Degradations of constraint due to the cluster count variance. Left: Colored 
are the Dm vs a s Confidence Level contours computed using the observed Monte-Carlo 
count variance, all other cosmological parameters set to their simulated values. Only the 
information on SZ-cluster count has been used in this figure, no redshift. Dashed are the 
same CL constraints, assuming a Poisson-law field to field cluster count variation. 
Right: Dm cr 8 Confidence Level contours assuming 100% follow-up for cluster redshifts, 
assuming a Poisson-law field to field cluster count variation, all other cosmological pa¬ 
rameters set to simulated values. Colored contours are a copy of figure ITT 


4.4.1. Cluster count variance 

As shown at paragraph s. 51 the simulated source count variance is larger than a Poisson’s 
distribution of same expectation that we would assume from field to field variance. If we 
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now forget the simulation results, the cluster count carries richer cosmological information 
which is quantified in the first term of the likelihood. Figure IT21 shows the constraints 
computed with this optimistic assumption. We have not been able to decide yet where 
does the count variance comes from. First suspect is the cluster detection software, with 
an out of control feature. Second is the confusion due to the few large clusters, that mask 
smaller cluster and thus induces lower statistic processes in the count variance. Third 
suspect is the effect of one of the foreground. Therefore cluster detection algorithms 


for cosmology, ( Herranz et al 


2002 ; 


Pieroaoli et al 


2005; 


Melin 


2004; 


Pires et al 


200511 . 


should be evaluated, on their efficiency, on the purity of the recovered source catalogue, 
but definitely on the source count variance at the output of the chain. We are working 
on this issue. Solving/minimising this effect is now our first priority. 
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Figure 13. Left: confidence level map, on as and T*, marginalised on Om- Colors are 
computed from SZ-cluster data only, dashed lines uses WMAP and CFHTLS weak-shear 
Fisher matrix constraints, no systematic effect are included. 

Right: Lines are the constraints on cosmological parameters if we keep only the largest 
flux clusters. All other cosmological parameters, have been set. Diamond is the generated 
concordance model. White cross is the reconstructed model. Colors delimit the references 
CL contour. Lower statistic induces heavy loss in the constraint accuracy. 
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4.4.2. Degeneracy with late cluster physics 

Galactic physics event heats the extragalactic cluster gas, and thus contribute to the 
SZ-cluster signal in addition to the gravitational potential and virialisation. Late cluster 
gas heating mechanisms are not well known yet. Their contribution to gas heating is 
commonly parametrised in the mass-temperature relation as a normalisation factor, T*. 
Figure EH shows the CL contours placed on T* and as marginalised on f Im- We observe 
that with the input of WMAP and CFHT-LS weak-shear forecast, the residual correlation 
between T* and as is small. In addition on going X-Rays surveys from XMM satellite 
should provide a lot of information on cluster gas physics and allow precise determination 
of T*. Thus in the following we set T* to the value 1.9. 

4.4.3. Restriction to high-mass clusters 

Large clusters involve hundred of galaxies. Their gravitational potential is stronger than 
in smaller clusters and non-gravitational physics is less important than in low mass 
systems. As a result, their mass to temperature scaling law is expected to show a smaller 
dispersion. Thus one can foresee that heavy clusters will be statistically better modelled, 
and that constraints based on massive cluster observations will be reliable. It is thus 
instructive to study the cosmological constraints which can be derived from a sample 
restricted to high-mass clusters. Figiire lTdl shows the confidence level map computed from 
a catalogue, when we select clusters of flux larger than 7.4 x 10~ 4 arcmin 2 . The CL contour 
are significantly enlarged compared to the reference contour drawn in black, because of 
the much smaller statistic. This is a strong motivation for theorists to understand and 
build a model of low-mass clusters. 

4.4.4. Incomplete spectroscopic follow-up 

Large-array bolometer surveys will provide large cluster catalogues, including flux and 
positions and shapes for resolved clusters, but have to rely on follow-up experiments for 
redsliift measurements. Figure ITU show the impact of partial redshift coverage, assuming 
10% or 50% random coverage of clusters candidates. Remaining catalogue contamina¬ 
tions have been properly taken into account. We note that no significant bias on the CL 
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Figure 14. Left: impact on cosmological constraints, due to an incomplete redshift 
follow-up of cluster candidates. Black line is the 68 % CL contour assuming a 10% 
coverage follow-up. Dashed line assumes 50% coverage follow-up. Colored contours are a 
copy of figure fill 

Right: Lines show the systematic shift in the CL contour induced by neglecting con¬ 
taminants in the recovered source catalogue. This plot was generated assuming that 50% 
of the sources have been observed in follow-up for redshift. Colors stand for contours 
computed with the same dataset, but taking into account contaminations. 

contours is seen, but that the statistics are degraded. This shows that follow-up obser¬ 
vations will be very important for the accuracy of the physics output of large SZ-Cluster 
surveys. 

4.4.5. Neglecting contamination 

Assuming now that the redshift followup of the observations will be incomplete (as is 
very likely to be the case in practice), we quantify now the effect of neglecting con¬ 
tamination in the recovered catalogue. We assume in the paragraph that 50% of the 
sources have a redshift. Technically, this means using in the first factor of the likelihood 
the pdf count of true detected cluster and in the third factor, the integral over z 0 bs of 
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dz d B dY ob M i lUS ' Y i !lUS ' C )- Figured shows the results. We see that since we assumed 
that false detections are clusters, the reconstructed parameters are biased toward large 
erg j since contamination produce a spurious enhancement of the cluster distribution at 
low surface brightnesses. The reduced size of the CL contour is a secondary effect of the 
low Qm fitted value. 


5. Conclusion 


In this paper, we have explored, in details, the potential and limitations of upcoming 
wide-held SZ surveys. We used a full simulation pipeline, going from the cosmological 
models to recovered cluster catalogues and constraints on cosmological parameters. We 
showed that the selection function and purity of the recovered catalogues are more com¬ 
plex than is usually assumed. We quantified the foreseen selection function, photometry 
and contamination of the upcoming Olimpo project. 

We observed a cluster count variance larger than Poisson’s width, when running several 
Monte-Carlo simulations. This effect worsens the cosmological constraints. We did not 
identify yet the origin of this excess width, but reducing it is our first development pri¬ 
ority in order to achieve the full potential of SZ-surveys for cosmology. 

We presented methods to statistically model the observations, select parametrised models 
compatible with data, and then constraint models parameters. We showed that, using SZ 
Cluster data, combined with WMAP and expected CFHTLS weak-shear forecast data, 
little correlation is seen between efficiently the mass-temperature normalisation factor T* 
and <78- Complementary X-Rays observations will be necessary to put tighter constraints 
on T*, but on the other hand, we only need moderate precision on T*, to achieve good 
knowledge on as- We finally exposed the impact on cosmological parameters of system- 
atics in observations or interpretation of our data. 

Th is paper does not use the 2-point correlation of SZ Cluster to constraint cosmology 
( Mei X BartietJboO.I ). This incorporation of the 2-point correlation function in our sim¬ 
ulation pipeline and its impact on cosmological parameter is left to future work. 
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7. Annex: selection function data 

The following table 0 samples values of the selection function as a function of mass and 
Compton flux. 


Redshift 

90% efficiency flux [arcmiii 2 ] 

90% efficiency Mass [M sun ] 

0.1 

2.5 10“ 3 

2.9 10 14 

0.2 

1.5 10“ 3 

3.8 10 14 

0.3 

1.0 10" 3 

4.0 10 14 

0.4 

7.7 10“ 4 

4.5 10 14 

0.5 

6.5 10“ 4 

5.0 10 14 

0.7 

5.6 10" 4 

5.1 10 14 

1.0 

4.9 10 -4 

5.1 10 14 

2.0 

4.2 10" 4 

4.2 10 14 

3.0 

4.2 10" 4 

3.2 10 14 

4.0 

4.2 10" 4 

2.2 10 14 

5.0 

4.2 10" 4 

2.0 10 14 


Table 3. Selection function versus redshift. Value entered correspond to 90% efficiency. 
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